CT-SPA: Text sentiment polarity prediction model using semi-automatically expanded sentiment lexicon

نویسندگان

  • Tao-Hsing Chang
  • Ming-Jhih Lin
  • Chun-Hsien Chen
  • Shao-Yu Wang
چکیده

In this study, an automatic classification method based on the sentiment polarity of text is proposed. This method uses two sentiment dictionaries from different sources: the Chinese sentiment dictionary CSWN that integrates Chinese WordNet with SentiWordNet, and the sentiment dictionary obtained from a training corpus labeled with sentiment polarities. In this study, the sentiment polarity of text is analyzed using these two dictionaries, a mixed-rule approach, and a statistics-based prediction model. The proposed method is used to analyze a test corpus provided by the Topic-Based Chinese Message Polarity Classification task of SIGHAN-8, and the F1measure value is tested at 0.62.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expanding Opinion Lexicon with Domain Specific Opinion Words Using Semi-Supervised Approach

Opinion words as well as opinion phrases and idioms are very useful in sentiment analysis. All these terms together build opinion or sentiment lexicons. Therefore, opinion lexicons are large lists of terms that encode the sentiment of each phrase within it. Generally, to create such a lexicon automatically, high-precision classifiers use known sentiment vocabulary, e.g. the prior polarity of an...

متن کامل

Using Data Mining Techniques for Sentiment Shifter Identification

Sentiment shifters, i.e., words and expressions that can affect text polarity, play an important role in opinion mining. However, the limited ability of current automated opinion mining systems to handle shifters represents a major challenge. The majority of existing approaches rely on a manual list of shifters; few attempts have been made to automatically identify shifters in text. Most of the...

متن کامل

Analysing domain suitability of a sentiment lexicon by identifying distributionally bipolar words

Contemporary sentiment analysis approaches rely heavily on lexicon based methods. This is mainly due to their simplicity, although the best empirical results can be achieved by more complex techniques. We introduce a method to assess suitability of generic sentiment lexicons for a given domain, namely to identify frequent bigrams where a polar word switches polarity. Our bigrams are scored usin...

متن کامل

Sentiment Analysis Based on Expanded Aspect and Polarity-Ambiguous Word Lexicon

This paper focuses on the task of disambiguating polarity-ambiguous words and the task is reduced to sentiment classification of aspects, which we refer to sentiment expectation instead of semantic orientation widely used in previous researches. Polarity-ambiguous words refer to words like” large, small, high, low ”, which pose a challenging task on sentiment analysis. In order to disambiguate ...

متن کامل

Experiments on Hybrid Corpus-Based Sentiment Lexicon Acquisition

Numerous sentiment analysis applications make usage of a sentiment lexicon. In this paper we present experiments on hybrid sentiment lexicon acquisition. The approach is corpus-based and thus suitable for languages lacking general dictionarybased resources. The approach is a hybrid two-step process that combines semisupervised graph-based algorithms and supervised models. We evaluate the perfor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015